Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 5043 |
| Missing cells | 2698 |
| Missing cells (%) | 1.9% |
| Duplicate rows | 45 |
| Duplicate rows (%) | 0.9% |
| Total size in memory | 3.5 MiB |
| Average record size in memory | 718.5 B |
Variable types
| NUM | 16 |
|---|---|
| CAT | 11 |
| URL | 1 |
Reproduction
| Analysis started | 2020-03-15 19:50:47.175695 |
|---|---|
| Analysis finished | 2020-03-15 19:52:01.887402 |
| Version | pandas-profiling v2.5.3 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
| Dataset has 45 (0.9%) duplicate rows | Duplicates |
director_name has a high cardinality: 2398 distinct values | High cardinality |
actor_2_name has a high cardinality: 3032 distinct values | High cardinality |
genres has a high cardinality: 914 distinct values | High cardinality |
actor_1_name has a high cardinality: 2097 distinct values | High cardinality |
movie_title has a high cardinality: 4917 distinct values | High cardinality |
actor_3_name has a high cardinality: 3521 distinct values | High cardinality |
plot_keywords has a high cardinality: 4760 distinct values | High cardinality |
country has a high cardinality: 65 distinct values | High cardinality |
cast_total_facebook_likes is highly correlated with actor_1_facebook_likes | High Correlation |
actor_1_facebook_likes is highly correlated with cast_total_facebook_likes | High Correlation |
director_name has 104 (2.1%) missing values | Missing |
director_facebook_likes has 104 (2.1%) missing values | Missing |
gross has 884 (17.5%) missing values | Missing |
plot_keywords has 153 (3.0%) missing values | Missing |
content_rating has 303 (6.0%) missing values | Missing |
budget has 492 (9.8%) missing values | Missing |
title_year has 108 (2.1%) missing values | Missing |
aspect_ratio has 329 (6.5%) missing values | Missing |
budget is highly skewed (γ1 = 48.15743539) | Skewed |
director_facebook_likes has 907 (18.0%) zeros | Zeros |
actor_3_facebook_likes has 89 (1.8%) zeros | Zeros |
facenumber_in_poster has 2152 (42.7%) zeros | Zeros |
actor_2_facebook_likes has 55 (1.1%) zeros | Zeros |
movie_facebook_likes has 2181 (43.2%) zeros | Zeros |
color
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 19 |
| Missing (%) | 0.4% |
| Memory size | 19.8 KiB |
| Color | |
|---|---|
| Black and White | 209 |
| Value | Count | Frequency (%) | |
| Color | 4815 | 95.5% | |
| Black and White | 209 | 4.1% | |
| (Missing) | 19 | 0.4% |
Length
| Max length | 16 |
|---|---|
| Mean length | 5.44834424 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 12 | 75.0% | |
| Uppercase_Letter | 3 | 18.8% | |
| Space_Separator | 1 | 6.2% |
| Value | Count | Frequency (%) | |
| Latin | 15 | 93.8% | |
| Common | 1 | 6.2% |
| Value | Count | Frequency (%) | |
| ASCII | 16 | 100.0% |
| Distinct count | 2398 |
|---|---|
| Unique (%) | 48.6% |
| Missing | 104 |
| Missing (%) | 2.1% |
| Memory size | 19.8 KiB |
| Steven Spielberg | 26 |
|---|---|
| Woody Allen | 22 |
| Clint Eastwood | 20 |
| Martin Scorsese | 20 |
| Ridley Scott | 17 |
| Other values (2393) |
| Value | Count | Frequency (%) | |
| Steven Spielberg | 26 | 0.5% | |
| Woody Allen | 22 | 0.4% | |
| Clint Eastwood | 20 | 0.4% | |
| Martin Scorsese | 20 | 0.4% | |
| Ridley Scott | 17 | 0.3% | |
| Steven Soderbergh | 16 | 0.3% | |
| Tim Burton | 16 | 0.3% | |
| Spike Lee | 16 | 0.3% | |
| Renny Harlin | 15 | 0.3% | |
| Oliver Stone | 14 | 0.3% | |
| Other values (2388) | 4757 | 94.3% | |
| (Missing) | 104 | 2.1% |
Length
| Max length | 32 |
|---|---|
| Mean length | 12.87685901 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 41 | 53.9% | |
| Uppercase_Letter | 31 | 40.8% | |
| Other_Punctuation | 2 | 2.6% | |
| Dash_Punctuation | 1 | 1.3% | |
| Space_Separator | 1 | 1.3% |
| Value | Count | Frequency (%) | |
| Latin | 72 | 94.7% | |
| Common | 4 | 5.3% |
| Value | Count | Frequency (%) | |
| ASCII | 56 | 100.0% |
num_critic_for_reviews
Real number (ℝ≥0)
| Distinct count | 528 |
|---|---|
| Unique (%) | 10.6% |
| Missing | 50 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 140.1942719807731 |
|---|---|
| Minimum | 1.0 |
| Maximum | 813.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 50 |
| median | 110 |
| Q3 | 195 |
| 95-th percentile | 387 |
| Maximum | 813 |
| Range | 812 |
| Interquartile range (IQR) | 145 |
Descriptive statistics
| Standard deviation | 121.6016754 |
|---|---|
| Coefficient of variation (CV) | 0.8673797701 |
| Kurtosis | 2.91341641 |
| Mean | 140.194272 |
| Median Absolute Deviation (MAD) | 92.35207408 |
| Skewness | 1.5165327 |
| Sum | 699990 |
| Variance | 14786.96746 |
| Value | Count | Frequency (%) | |
| 1 | 43 | 0.9% | |
| 9 | 37 | 0.7% | |
| 5 | 36 | 0.7% | |
| 10 | 35 | 0.7% | |
| 8 | 35 | 0.7% | |
| 12 | 34 | 0.7% | |
| 81 | 33 | 0.7% | |
| 16 | 33 | 0.7% | |
| 43 | 31 | 0.6% | |
| 29 | 30 | 0.6% | |
| Other values (518) | 4646 | 92.1% | |
| (Missing) | 50 | 1.0% |
| Value | Count | Frequency (%) | |
| 1 | 43 | 0.9% | |
| 2 | 26 | 0.5% | |
| 3 | 24 | 0.5% | |
| 4 | 29 | 0.6% | |
| 5 | 36 | 0.7% |
| Value | Count | Frequency (%) | |
| 813 | 1 | < 0.1% | |
| 775 | 1 | < 0.1% | |
| 765 | 1 | < 0.1% | |
| 750 | 2 | < 0.1% | |
| 739 | 1 | < 0.1% |
duration
Real number (ℝ≥0)
| Distinct count | 191 |
|---|---|
| Unique (%) | 3.8% |
| Missing | 15 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 107.2010739856802 |
|---|---|
| Minimum | 7.0 |
| Maximum | 511.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 81 |
| Q1 | 93 |
| median | 103 |
| Q3 | 118 |
| 95-th percentile | 146 |
| Maximum | 511 |
| Range | 504 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 25.19744081 |
|---|---|
| Coefficient of variation (CV) | 0.235048399 |
| Kurtosis | 22.56579716 |
| Mean | 107.201074 |
| Median Absolute Deviation (MAD) | 16.81590041 |
| Skewness | 2.339134041 |
| Sum | 539007 |
| Variance | 634.9110233 |
| Value | Count | Frequency (%) | |
| 90 | 161 | 3.2% | |
| 100 | 141 | 2.8% | |
| 101 | 139 | 2.8% | |
| 98 | 135 | 2.7% | |
| 97 | 131 | 2.6% | |
| 93 | 129 | 2.6% | |
| 95 | 124 | 2.5% | |
| 99 | 124 | 2.5% | |
| 94 | 124 | 2.5% | |
| 96 | 113 | 2.2% | |
| Other values (181) | 3707 | 73.5% |
| Value | Count | Frequency (%) | |
| 7 | 2 | < 0.1% | |
| 11 | 1 | < 0.1% | |
| 14 | 1 | < 0.1% | |
| 20 | 1 | < 0.1% | |
| 22 | 7 | 0.1% |
| Value | Count | Frequency (%) | |
| 511 | 1 | < 0.1% | |
| 334 | 1 | < 0.1% | |
| 330 | 1 | < 0.1% | |
| 325 | 1 | < 0.1% | |
| 300 | 1 | < 0.1% |
| Distinct count | 435 |
|---|---|
| Unique (%) | 8.8% |
| Missing | 104 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 686.5092123911724 |
|---|---|
| Minimum | 0.0 |
| Maximum | 23000.0 |
| Zeros | 907 |
| Zeros (%) | 18.0% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7 |
| median | 49 |
| Q3 | 194.5 |
| 95-th percentile | 973 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 187.5 |
Descriptive statistics
| Standard deviation | 2813.328607 |
|---|---|
| Coefficient of variation (CV) | 4.098020181 |
| Kurtosis | 27.25628935 |
| Mean | 686.5092124 |
| Median Absolute Deviation (MAD) | 1069.818414 |
| Skewness | 5.22970117 |
| Sum | 3390669 |
| Variance | 7914817.85 |
| Value | Count | Frequency (%) | |
| 0 | 907 | 18.0% | |
| 3 | 70 | 1.4% | |
| 6 | 66 | 1.3% | |
| 7 | 64 | 1.3% | |
| 2 | 63 | 1.2% | |
| 4 | 60 | 1.2% | |
| 11 | 59 | 1.2% | |
| 10 | 53 | 1.1% | |
| 8 | 52 | 1.0% | |
| 5 | 52 | 1.0% | |
| Other values (425) | 3493 | 69.3% | |
| (Missing) | 104 | 2.1% |
| Value | Count | Frequency (%) | |
| 0 | 907 | 18.0% | |
| 2 | 63 | 1.2% | |
| 3 | 70 | 1.4% | |
| 4 | 60 | 1.2% | |
| 5 | 52 | 1.0% |
| Value | Count | Frequency (%) | |
| 23000 | 1 | < 0.1% | |
| 22000 | 8 | 0.2% | |
| 21000 | 10 | 0.2% | |
| 20000 | 1 | < 0.1% | |
| 18000 | 4 | 0.1% |
| Distinct count | 906 |
|---|---|
| Unique (%) | 18.0% |
| Missing | 23 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 645.0097609561753 |
|---|---|
| Minimum | 0.0 |
| Maximum | 23000.0 |
| Zeros | 89 |
| Zeros (%) | 1.8% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 133 |
| median | 371.5 |
| Q3 | 636 |
| 95-th percentile | 1000 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 503 |
Descriptive statistics
| Standard deviation | 1665.041728 |
|---|---|
| Coefficient of variation (CV) | 2.581420979 |
| Kurtosis | 60.56388811 |
| Mean | 645.009761 |
| Median Absolute Deviation (MAD) | 569.3467201 |
| Skewness | 7.279020793 |
| Sum | 3237949 |
| Variance | 2772363.957 |
| Value | Count | Frequency (%) | |
| 1000 | 126 | 2.5% | |
| 0 | 89 | 1.8% | |
| 11000 | 29 | 0.6% | |
| 3 | 28 | 0.6% | |
| 2000 | 27 | 0.5% | |
| 3000 | 26 | 0.5% | |
| 826 | 22 | 0.4% | |
| 2 | 21 | 0.4% | |
| 4 | 21 | 0.4% | |
| 7 | 21 | 0.4% | |
| Other values (896) | 4610 | 91.4% | |
| (Missing) | 23 | 0.5% |
| Value | Count | Frequency (%) | |
| 0 | 89 | 1.8% | |
| 2 | 21 | 0.4% | |
| 3 | 28 | 0.6% | |
| 4 | 21 | 0.4% | |
| 5 | 18 | 0.4% |
| Value | Count | Frequency (%) | |
| 23000 | 2 | < 0.1% | |
| 20000 | 1 | < 0.1% | |
| 19000 | 5 | 0.1% | |
| 17000 | 1 | < 0.1% | |
| 16000 | 3 | 0.1% |
| Distinct count | 3032 |
|---|---|
| Unique (%) | 60.3% |
| Missing | 13 |
| Missing (%) | 0.3% |
| Memory size | 19.8 KiB |
| Morgan Freeman | 20 |
|---|---|
| Charlize Theron | 15 |
| Brad Pitt | 14 |
| Meryl Streep | 11 |
| James Franco | 11 |
| Other values (3027) |
| Value | Count | Frequency (%) | |
| Morgan Freeman | 20 | 0.4% | |
| Charlize Theron | 15 | 0.3% | |
| Brad Pitt | 14 | 0.3% | |
| Meryl Streep | 11 | 0.2% | |
| James Franco | 11 | 0.2% | |
| Adam Sandler | 10 | 0.2% | |
| Jason Flemyng | 10 | 0.2% | |
| Angelina Jolie Pitt | 9 | 0.2% | |
| Thomas Kretschmann | 9 | 0.2% | |
| Steve Buscemi | 9 | 0.2% | |
| Other values (3022) | 4912 | 97.4% | |
| (Missing) | 13 | 0.3% |
Length
| Max length | 28 |
|---|---|
| Mean length | 13.0483839 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 48 | 60.0% | |
| Uppercase_Letter | 26 | 32.5% | |
| Decimal_Number | 2 | 2.5% | |
| Other_Punctuation | 2 | 2.5% | |
| Dash_Punctuation | 1 | 1.2% | |
| Space_Separator | 1 | 1.2% |
| Value | Count | Frequency (%) | |
| Latin | 74 | 92.5% | |
| Common | 6 | 7.5% |
| Value | Count | Frequency (%) | |
| ASCII | 58 | 100.0% |
| Distinct count | 878 |
|---|---|
| Unique (%) | 17.4% |
| Missing | 7 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6560.04706115965 |
|---|---|
| Minimum | 0.0 |
| Maximum | 640000.0 |
| Zeros | 26 |
| Zeros (%) | 0.5% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 95.5 |
| Q1 | 614 |
| median | 988 |
| Q3 | 11000 |
| 95-th percentile | 24000 |
| Maximum | 640000 |
| Range | 640000 |
| Interquartile range (IQR) | 10386 |
Descriptive statistics
| Standard deviation | 15020.75912 |
|---|---|
| Coefficient of variation (CV) | 2.289733439 |
| Kurtosis | 683.5473559 |
| Mean | 6560.047061 |
| Median Absolute Deviation (MAD) | 7727.675203 |
| Skewness | 19.12177638 |
| Sum | 33036397 |
| Variance | 225623204.5 |
| Value | Count | Frequency (%) | |
| 1000 | 449 | 8.9% | |
| 11000 | 211 | 4.2% | |
| 2000 | 197 | 3.9% | |
| 3000 | 155 | 3.1% | |
| 12000 | 135 | 2.7% | |
| 13000 | 127 | 2.5% | |
| 14000 | 123 | 2.4% | |
| 10000 | 112 | 2.2% | |
| 18000 | 109 | 2.2% | |
| 22000 | 82 | 1.6% | |
| Other values (868) | 3336 | 66.2% |
| Value | Count | Frequency (%) | |
| 0 | 26 | 0.5% | |
| 2 | 8 | 0.2% | |
| 3 | 4 | 0.1% | |
| 4 | 2 | < 0.1% | |
| 5 | 7 | 0.1% |
| Value | Count | Frequency (%) | |
| 640000 | 1 | < 0.1% | |
| 260000 | 3 | 0.1% | |
| 164000 | 2 | < 0.1% | |
| 137000 | 2 | < 0.1% | |
| 87000 | 8 | 0.2% |
| Distinct count | 4035 |
|---|---|
| Unique (%) | 97.0% |
| Missing | 884 |
| Missing (%) | 17.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48468407.52680933 |
|---|---|
| Minimum | 162.0 |
| Maximum | 760505847.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 162 |
|---|---|
| 5-th percentile | 99034 |
| Q1 | 5340987.5 |
| median | 25517500 |
| Q3 | 62309437.5 |
| 95-th percentile | 180029729.4 |
| Maximum | 760505847 |
| Range | 760505685 |
| Interquartile range (IQR) | 56968450 |
Descriptive statistics
| Standard deviation | 68452990.44 |
|---|---|
| Coefficient of variation (CV) | 1.412321839 |
| Kurtosis | 14.86886885 |
| Mean | 48468407.53 |
| Median Absolute Deviation (MAD) | 45141337.64 |
| Skewness | 3.127203838 |
| Sum | 2.015801069e+11 |
| Variance | 4.6858119e+15 |
| Value | Count | Frequency (%) | |
| 144512310 | 3 | 0.1% | |
| 5773519 | 3 | 0.1% | |
| 177343675 | 3 | 0.1% | |
| 34964818 | 3 | 0.1% | |
| 47000000 | 3 | 0.1% | |
| 218051260 | 3 | 0.1% | |
| 3000000 | 3 | 0.1% | |
| 8000000 | 3 | 0.1% | |
| 800000 | 2 | < 0.1% | |
| 22494487 | 2 | < 0.1% | |
| Other values (4025) | 4131 | 81.9% | |
| (Missing) | 884 | 17.5% |
| Value | Count | Frequency (%) | |
| 162 | 1 | < 0.1% | |
| 703 | 1 | < 0.1% | |
| 721 | 1 | < 0.1% | |
| 728 | 1 | < 0.1% | |
| 828 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 760505847 | 1 | < 0.1% | |
| 658672302 | 1 | < 0.1% | |
| 652177271 | 1 | < 0.1% | |
| 623279547 | 2 | < 0.1% | |
| 533316061 | 1 | < 0.1% |
| Distinct count | 914 |
|---|---|
| Unique (%) | 18.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.8 KiB |
| Drama | 236 |
|---|---|
| Comedy | 209 |
| Comedy|Drama | 191 |
| Comedy|Drama|Romance | 187 |
| Comedy|Romance | 158 |
| Other values (909) |
| Value | Count | Frequency (%) | |
| Drama | 236 | 4.7% | |
| Comedy | 209 | 4.1% | |
| Comedy|Drama | 191 | 3.8% | |
| Comedy|Drama|Romance | 187 | 3.7% | |
| Comedy|Romance | 158 | 3.1% | |
| Drama|Romance | 152 | 3.0% | |
| Crime|Drama|Thriller | 101 | 2.0% | |
| Horror | 71 | 1.4% | |
| Action|Crime|Drama|Thriller | 68 | 1.3% | |
| Action|Crime|Thriller | 65 | 1.3% | |
| Other values (904) | 3605 | 71.5% |
Length
| Max length | 64 |
|---|---|
| Mean length | 20.31310728 |
| Min length | 5 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 19 | 54.3% | |
| Uppercase_Letter | 14 | 40.0% | |
| Dash_Punctuation | 1 | 2.9% | |
| Math_Symbol | 1 | 2.9% |
| Value | Count | Frequency (%) | |
| Latin | 33 | 94.3% | |
| Common | 2 | 5.7% |
| Value | Count | Frequency (%) | |
| ASCII | 35 | 100.0% |
| Distinct count | 2097 |
|---|---|
| Unique (%) | 41.6% |
| Missing | 7 |
| Missing (%) | 0.1% |
| Memory size | 19.8 KiB |
| Robert De Niro | 49 |
|---|---|
| Johnny Depp | 41 |
| Nicolas Cage | 33 |
| J.K. Simmons | 31 |
| Bruce Willis | 30 |
| Other values (2092) |
| Value | Count | Frequency (%) | |
| Robert De Niro | 49 | 1.0% | |
| Johnny Depp | 41 | 0.8% | |
| Nicolas Cage | 33 | 0.7% | |
| J.K. Simmons | 31 | 0.6% | |
| Bruce Willis | 30 | 0.6% | |
| Matt Damon | 30 | 0.6% | |
| Denzel Washington | 30 | 0.6% | |
| Liam Neeson | 29 | 0.6% | |
| Steve Buscemi | 27 | 0.5% | |
| Robin Williams | 27 | 0.5% | |
| Other values (2087) | 4709 | 93.4% |
Length
| Max length | 27 |
|---|---|
| Mean length | 13.1782669 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 42 | 55.3% | |
| Uppercase_Letter | 28 | 36.8% | |
| Decimal_Number | 2 | 2.6% | |
| Other_Punctuation | 2 | 2.6% | |
| Dash_Punctuation | 1 | 1.3% | |
| Space_Separator | 1 | 1.3% |
| Value | Count | Frequency (%) | |
| Latin | 70 | 92.1% | |
| Common | 6 | 7.9% |
| Value | Count | Frequency (%) | |
| ASCII | 58 | 100.0% |
| Distinct count | 4917 |
|---|---|
| Unique (%) | 97.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.8 KiB |
| Pan | 3 |
|---|---|
| Victor Frankenstein | 3 |
| The Fast and the Furious | 3 |
| Home | 3 |
| Halloween | 3 |
| Other values (4912) |
| Value | Count | Frequency (%) | |
| Pan | 3 | 0.1% | |
| Victor Frankenstein | 3 | 0.1% | |
| The Fast and the Furious | 3 | 0.1% | |
| Home | 3 | 0.1% | |
| Halloween | 3 | 0.1% | |
| King Kong | 3 | 0.1% | |
| Ben-Hur | 3 | 0.1% | |
| Godzilla Resurgence | 2 | < 0.1% | |
| The Texas Chain Saw Massacre | 2 | < 0.1% | |
| Dodgeball: A True Underdog Story | 2 | < 0.1% | |
| Other values (4907) | 5016 | 99.5% |
Length
| Max length | 87 |
|---|---|
| Mean length | 16.54967281 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 35 | 36.1% | |
| Uppercase_Letter | 27 | 27.8% | |
| Other_Punctuation | 12 | 12.4% | |
| Decimal_Number | 10 | 10.3% | |
| Open_Punctuation | 2 | 2.1% | |
| Space_Separator | 2 | 2.1% | |
| Currency_Symbol | 2 | 2.1% | |
| Close_Punctuation | 2 | 2.1% | |
| Dash_Punctuation | 1 | 1.0% | |
| Other_Symbol | 1 | 1.0% | |
| Other values (3) | 3 | 3.1% |
| Value | Count | Frequency (%) | |
| Latin | 62 | 63.9% | |
| Common | 35 | 36.1% |
| Value | Count | Frequency (%) | |
| ASCII | 82 | 100.0% |
num_voted_users
Real number (ℝ≥0)
| Distinct count | 4826 |
|---|---|
| Unique (%) | 95.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83668.16081697402 |
|---|---|
| Minimum | 5 |
| Maximum | 1689764 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 514.6 |
| Q1 | 8593.5 |
| median | 34359 |
| Q3 | 96309 |
| 95-th percentile | 332254.9 |
| Maximum | 1689764 |
| Range | 1689759 |
| Interquartile range (IQR) | 87715.5 |
Descriptive statistics
| Standard deviation | 138485.2568 |
|---|---|
| Coefficient of variation (CV) | 1.655172714 |
| Kurtosis | 24.44552017 |
| Mean | 83668.16082 |
| Median Absolute Deviation (MAD) | 84252.04372 |
| Skewness | 4.029871144 |
| Sum | 421938535 |
| Variance | 1.917816635e+10 |
| Value | Count | Frequency (%) | |
| 57 | 5 | 0.1% | |
| 6 | 4 | 0.1% | |
| 6025 | 3 | 0.1% | |
| 374 | 3 | 0.1% | |
| 53 | 3 | 0.1% | |
| 3119 | 3 | 0.1% | |
| 62 | 3 | 0.1% | |
| 162 | 3 | 0.1% | |
| 2541 | 3 | 0.1% | |
| 8 | 3 | 0.1% | |
| Other values (4816) | 5010 | 99.3% |
| Value | Count | Frequency (%) | |
| 5 | 2 | < 0.1% | |
| 6 | 4 | 0.1% | |
| 7 | 2 | < 0.1% | |
| 8 | 3 | 0.1% | |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1689764 | 1 | < 0.1% | |
| 1676169 | 1 | < 0.1% | |
| 1468200 | 1 | < 0.1% | |
| 1347461 | 1 | < 0.1% | |
| 1324680 | 1 | < 0.1% |
| Distinct count | 3978 |
|---|---|
| Unique (%) | 78.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9699.06385088241 |
|---|---|
| Minimum | 0 |
| Maximum | 656730 |
| Zeros | 33 |
| Zeros (%) | 0.7% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 179 |
| Q1 | 1411 |
| median | 3090 |
| Q3 | 13756.5 |
| 95-th percentile | 36927.7 |
| Maximum | 656730 |
| Range | 656730 |
| Interquartile range (IQR) | 12345.5 |
Descriptive statistics
| Standard deviation | 18163.79912 |
|---|---|
| Coefficient of variation (CV) | 1.872737349 |
| Kurtosis | 361.2551153 |
| Mean | 9699.063851 |
| Median Absolute Deviation (MAD) | 10152.51874 |
| Skewness | 12.83192773 |
| Sum | 48912379 |
| Variance | 329923598.6 |
| Value | Count | Frequency (%) | |
| 0 | 33 | 0.7% | |
| 5 | 7 | 0.1% | |
| 2020 | 6 | 0.1% | |
| 2 | 6 | 0.1% | |
| 1044 | 5 | 0.1% | |
| 673 | 5 | 0.1% | |
| 29 | 5 | 0.1% | |
| 2321 | 4 | 0.1% | |
| 1554 | 4 | 0.1% | |
| 646 | 4 | 0.1% | |
| Other values (3968) | 4964 | 98.4% |
| Value | Count | Frequency (%) | |
| 0 | 33 | 0.7% | |
| 2 | 6 | 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 2 | < 0.1% | |
| 5 | 7 | 0.1% |
| Value | Count | Frequency (%) | |
| 656730 | 1 | < 0.1% | |
| 303717 | 1 | < 0.1% | |
| 283939 | 1 | < 0.1% | |
| 263584 | 1 | < 0.1% | |
| 261818 | 1 | < 0.1% |
| Distinct count | 3521 |
|---|---|
| Unique (%) | 70.1% |
| Missing | 23 |
| Missing (%) | 0.5% |
| Memory size | 19.8 KiB |
| John Heard | 8 |
|---|---|
| Ben Mendelsohn | 8 |
| Steve Coogan | 8 |
| Kirsten Dunst | 7 |
| Sam Shepard | 7 |
| Other values (3516) |
| Value | Count | Frequency (%) | |
| John Heard | 8 | 0.2% | |
| Ben Mendelsohn | 8 | 0.2% | |
| Steve Coogan | 8 | 0.2% | |
| Kirsten Dunst | 7 | 0.1% | |
| Sam Shepard | 7 | 0.1% | |
| Anne Hathaway | 7 | 0.1% | |
| Jon Gries | 7 | 0.1% | |
| Stephen Root | 7 | 0.1% | |
| Lois Maxwell | 7 | 0.1% | |
| Robert Duvall | 7 | 0.1% | |
| Other values (3511) | 4947 | 98.1% | |
| (Missing) | 23 | 0.5% |
Length
| Max length | 29 |
|---|---|
| Mean length | 13.03628792 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 44 | 54.3% | |
| Uppercase_Letter | 31 | 38.3% | |
| Decimal_Number | 2 | 2.5% | |
| Other_Punctuation | 2 | 2.5% | |
| Dash_Punctuation | 1 | 1.2% | |
| Space_Separator | 1 | 1.2% |
| Value | Count | Frequency (%) | |
| Latin | 75 | 92.6% | |
| Common | 6 | 7.4% |
| Value | Count | Frequency (%) | |
| ASCII | 58 | 100.0% |
| Distinct count | 19 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 13 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3711729622266402 |
|---|---|
| Minimum | 0.0 |
| Maximum | 43.0 |
| Zeros | 2152 |
| Zeros (%) | 42.7% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.01357592 |
|---|---|
| Coefficient of variation (CV) | 1.468506144 |
| Kurtosis | 52.03373533 |
| Mean | 1.371172962 |
| Median Absolute Deviation (MAD) | 1.357893277 |
| Skewness | 4.384765939 |
| Sum | 6897 |
| Variance | 4.054487986 |
| Value | Count | Frequency (%) | |
| 0 | 2152 | 42.7% | |
| 1 | 1251 | 24.8% | |
| 2 | 716 | 14.2% | |
| 3 | 380 | 7.5% | |
| 4 | 207 | 4.1% | |
| 5 | 114 | 2.3% | |
| 6 | 76 | 1.5% | |
| 7 | 48 | 1.0% | |
| 8 | 37 | 0.7% | |
| 9 | 18 | 0.4% | |
| Other values (9) | 31 | 0.6% | |
| (Missing) | 13 | 0.3% |
| Value | Count | Frequency (%) | |
| 0 | 2152 | 42.7% | |
| 1 | 1251 | 24.8% | |
| 2 | 716 | 14.2% | |
| 3 | 380 | 7.5% | |
| 4 | 207 | 4.1% |
| Value | Count | Frequency (%) | |
| 43 | 1 | < 0.1% | |
| 31 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 15 | 6 | 0.1% | |
| 14 | 1 | < 0.1% |
| Distinct count | 4760 |
|---|---|
| Unique (%) | 97.3% |
| Missing | 153 |
| Missing (%) | 3.0% |
| Memory size | 19.8 KiB |
| based on novel | 4 |
|---|---|
| animal name in title|ape abducts a woman|gorilla|island|king kong | 3 |
| 1940s|child hero|fantasy world|orphan|reference to peter pan | 3 |
| alien friendship|alien invasion|australia|flying car|mother daughter relationship | 3 |
| halloween|masked killer|michael myers|slasher|trick or treat | 3 |
| Other values (4755) |
| Value | Count | Frequency (%) | |
| based on novel | 4 | 0.1% | |
| animal name in title|ape abducts a woman|gorilla|island|king kong | 3 | 0.1% | |
| 1940s|child hero|fantasy world|orphan|reference to peter pan | 3 | 0.1% | |
| alien friendship|alien invasion|australia|flying car|mother daughter relationship | 3 | 0.1% | |
| halloween|masked killer|michael myers|slasher|trick or treat | 3 | 0.1% | |
| eighteen wheeler|illegal street racing|truck|trucker|undercover cop | 3 | 0.1% | |
| one word title | 3 | 0.1% | |
| assistant|experiment|frankenstein|medical student|scientist | 3 | 0.1% | |
| race relations|racism|racist|social problem|stereotype | 2 | < 0.1% | |
| casino|espionage|free running|james bond|terrorist | 2 | < 0.1% | |
| Other values (4750) | 4861 | 96.4% | |
| (Missing) | 153 | 3.0% |
Length
| Max length | 149 |
|---|---|
| Mean length | 50.93337299 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 26 | 61.9% | |
| Decimal_Number | 10 | 23.8% | |
| Other_Punctuation | 2 | 4.8% | |
| Open_Punctuation | 1 | 2.4% | |
| Space_Separator | 1 | 2.4% | |
| Math_Symbol | 1 | 2.4% | |
| Close_Punctuation | 1 | 2.4% |
| Value | Count | Frequency (%) | |
| Latin | 26 | 61.9% | |
| Common | 16 | 38.1% |
| Value | Count | Frequency (%) | |
| ASCII | 42 | 100.0% |
| Distinct count | 4919 |
|---|---|
| Unique (%) | 97.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.8 KiB |
| http://www.imdb.com/title/tt0232500/?ref_=fn_tt_tt_1 | 3 |
|---|---|
| http://www.imdb.com/title/tt0077651/?ref_=fn_tt_tt_1 | 3 |
| http://www.imdb.com/title/tt1976009/?ref_=fn_tt_tt_1 | 3 |
| http://www.imdb.com/title/tt3332064/?ref_=fn_tt_tt_1 | 3 |
| http://www.imdb.com/title/tt0360717/?ref_=fn_tt_tt_1 | 3 |
| Other values (4914) |
| Value | Count | Frequency (%) | |
| http://www.imdb.com/title/tt0232500/?ref_=fn_tt_tt_1 | 3 | 0.1% | |
| http://www.imdb.com/title/tt0077651/?ref_=fn_tt_tt_1 | 3 | 0.1% | |
| http://www.imdb.com/title/tt1976009/?ref_=fn_tt_tt_1 | 3 | 0.1% | |
| http://www.imdb.com/title/tt3332064/?ref_=fn_tt_tt_1 | 3 | 0.1% | |
| http://www.imdb.com/title/tt0360717/?ref_=fn_tt_tt_1 | 3 | 0.1% | |
| http://www.imdb.com/title/tt2224026/?ref_=fn_tt_tt_1 | 3 | 0.1% | |
| http://www.imdb.com/title/tt2638144/?ref_=fn_tt_tt_1 | 3 | 0.1% | |
| http://www.imdb.com/title/tt0083722/?ref_=fn_tt_tt_1 | 2 | < 0.1% | |
| http://www.imdb.com/title/tt0844708/?ref_=fn_tt_tt_1 | 2 | < 0.1% | |
| http://www.imdb.com/title/tt1666335/?ref_=fn_tt_tt_1 | 2 | < 0.1% | |
| Other values (4909) | 5016 | 99.5% |
| Value | Count | Frequency (%) | |
| http | 5043 | 100.0% |
| Value | Count | Frequency (%) | |
| www.imdb.com | 5043 | 100.0% |
| Value | Count | Frequency (%) | |
| /title/tt3332064/ | 3 | 0.1% | |
| /title/tt2638144/ | 3 | 0.1% | |
| /title/tt0232500/ | 3 | 0.1% | |
| /title/tt2224026/ | 3 | 0.1% | |
| /title/tt1976009/ | 3 | 0.1% | |
| /title/tt0077651/ | 3 | 0.1% | |
| /title/tt0360717/ | 3 | 0.1% | |
| /title/tt0844708/ | 2 | < 0.1% | |
| /title/tt0138304/ | 2 | < 0.1% | |
| /title/tt0795368/ | 2 | < 0.1% | |
| Other values (4909) | 5016 | 99.5% |
| Value | Count | Frequency (%) | |
| ref_=fn_tt_tt_1 | 5043 | 100.0% |
| Value | Count | Frequency (%) | |
| 5043 | 100.0% |
num_user_for_reviews
Real number (ℝ≥0)
| Distinct count | 954 |
|---|---|
| Unique (%) | 19.0% |
| Missing | 21 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 272.77080844285143 |
|---|---|
| Minimum | 1.0 |
| Maximum | 5060.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 65 |
| median | 156 |
| Q3 | 326 |
| 95-th percentile | 907.8 |
| Maximum | 5060 |
| Range | 5059 |
| Interquartile range (IQR) | 261 |
Descriptive statistics
| Standard deviation | 377.9828856 |
|---|---|
| Coefficient of variation (CV) | 1.385716044 |
| Kurtosis | 26.43829739 |
| Mean | 272.7708084 |
| Median Absolute Deviation (MAD) | 228.8571855 |
| Skewness | 4.121475159 |
| Sum | 1369855 |
| Variance | 142871.0618 |
| Value | Count | Frequency (%) | |
| 1 | 51 | 1.0% | |
| 3 | 33 | 0.7% | |
| 26 | 32 | 0.6% | |
| 2 | 32 | 0.6% | |
| 10 | 29 | 0.6% | |
| 6 | 28 | 0.6% | |
| 50 | 26 | 0.5% | |
| 32 | 25 | 0.5% | |
| 8 | 25 | 0.5% | |
| 31 | 24 | 0.5% | |
| Other values (944) | 4717 | 93.5% |
| Value | Count | Frequency (%) | |
| 1 | 51 | 1.0% | |
| 2 | 32 | 0.6% | |
| 3 | 33 | 0.7% | |
| 4 | 23 | 0.5% | |
| 5 | 19 | 0.4% |
| Value | Count | Frequency (%) | |
| 5060 | 1 | < 0.1% | |
| 4667 | 1 | < 0.1% | |
| 4144 | 1 | < 0.1% | |
| 3646 | 1 | < 0.1% | |
| 3597 | 1 | < 0.1% |
language
Categorical
| Distinct count | 47 |
|---|---|
| Unique (%) | 0.9% |
| Missing | 12 |
| Missing (%) | 0.2% |
| Memory size | 19.8 KiB |
| English | |
|---|---|
| French | 73 |
| Spanish | 40 |
| Hindi | 28 |
| Mandarin | 26 |
| Other values (42) | 160 |
| Value | Count | Frequency (%) | |
| English | 4704 | 93.3% | |
| French | 73 | 1.4% | |
| Spanish | 40 | 0.8% | |
| Hindi | 28 | 0.6% | |
| Mandarin | 26 | 0.5% | |
| German | 19 | 0.4% | |
| Japanese | 18 | 0.4% | |
| Italian | 11 | 0.2% | |
| Cantonese | 11 | 0.2% | |
| Russian | 11 | 0.2% | |
| Other values (37) | 90 | 1.8% | |
| (Missing) | 12 | 0.2% |
Length
| Max length | 10 |
|---|---|
| Mean length | 6.971247273 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 23 | 53.5% | |
| Uppercase_Letter | 20 | 46.5% |
| Value | Count | Frequency (%) | |
| Latin | 43 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 43 | 100.0% |
| Distinct count | 65 |
|---|---|
| Unique (%) | 1.3% |
| Missing | 5 |
| Missing (%) | 0.1% |
| Memory size | 19.8 KiB |
| USA | |
|---|---|
| UK | 448 |
| France | 154 |
| Canada | 126 |
| Germany | 97 |
| Other values (60) | 406 |
| Value | Count | Frequency (%) | |
| USA | 3807 | 75.5% | |
| UK | 448 | 8.9% | |
| France | 154 | 3.1% | |
| Canada | 126 | 2.5% | |
| Germany | 97 | 1.9% | |
| Australia | 55 | 1.1% | |
| India | 34 | 0.7% | |
| Spain | 33 | 0.7% | |
| China | 30 | 0.6% | |
| Japan | 23 | 0.5% | |
| Other values (55) | 231 | 4.6% |
Length
| Max length | 20 |
|---|---|
| Mean length | 3.488796351 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 24 | 51.1% | |
| Uppercase_Letter | 22 | 46.8% | |
| Space_Separator | 1 | 2.1% |
| Value | Count | Frequency (%) | |
| Latin | 46 | 97.9% | |
| Common | 1 | 2.1% |
| Value | Count | Frequency (%) | |
| ASCII | 47 | 100.0% |
| Distinct count | 18 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 303 |
| Missing (%) | 6.0% |
| Memory size | 19.8 KiB |
| R | |
|---|---|
| PG-13 | |
| PG | |
| Not Rated | 116 |
| G | 112 |
| Other values (13) | 232 |
| Value | Count | Frequency (%) | |
| R | 2118 | 42.0% | |
| PG-13 | 1461 | 29.0% | |
| PG | 701 | 13.9% | |
| Not Rated | 116 | 2.3% | |
| G | 112 | 2.2% | |
| Unrated | 62 | 1.2% | |
| Approved | 55 | 1.1% | |
| TV-14 | 30 | 0.6% | |
| TV-MA | 20 | 0.4% | |
| X | 13 | 0.3% | |
| Other values (8) | 52 | 1.0% | |
| (Missing) | 303 | 6.0% |
Length
| Max length | 9 |
|---|---|
| Mean length | 2.825104105 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 12 | 42.9% | |
| Lowercase_Letter | 10 | 35.7% | |
| Decimal_Number | 4 | 14.3% | |
| Dash_Punctuation | 1 | 3.6% | |
| Space_Separator | 1 | 3.6% |
| Value | Count | Frequency (%) | |
| Latin | 22 | 78.6% | |
| Common | 6 | 21.4% |
| Value | Count | Frequency (%) | |
| ASCII | 28 | 100.0% |
| Distinct count | 439 |
|---|---|
| Unique (%) | 9.6% |
| Missing | 492 |
| Missing (%) | 9.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39752620.436387606 |
|---|---|
| Minimum | 218.0 |
| Maximum | 12215500000.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 218 |
|---|---|
| 5-th percentile | 500000 |
| Q1 | 6000000 |
| median | 20000000 |
| Q3 | 45000000 |
| 95-th percentile | 130000000 |
| Maximum | 1.22155e+10 |
| Range | 1.221549978e+10 |
| Interquartile range (IQR) | 39000000 |
Descriptive statistics
| Standard deviation | 206114898.4 |
|---|---|
| Coefficient of variation (CV) | 5.184938658 |
| Kurtosis | 2724.257433 |
| Mean | 39752620.44 |
| Median Absolute Deviation (MAD) | 37695559.05 |
| Skewness | 48.15743539 |
| Sum | 1.809141756e+11 |
| Variance | 4.248335136e+16 |
| Value | Count | Frequency (%) | |
| 20000000 | 174 | 3.5% | |
| 15000000 | 143 | 2.8% | |
| 25000000 | 142 | 2.8% | |
| 30000000 | 141 | 2.8% | |
| 10000000 | 135 | 2.7% | |
| 40000000 | 131 | 2.6% | |
| 35000000 | 120 | 2.4% | |
| 5000000 | 111 | 2.2% | |
| 50000000 | 101 | 2.0% | |
| 60000000 | 92 | 1.8% | |
| Other values (429) | 3261 | 64.7% | |
| (Missing) | 492 | 9.8% |
| Value | Count | Frequency (%) | |
| 218 | 1 | < 0.1% | |
| 1100 | 1 | < 0.1% | |
| 1400 | 1 | < 0.1% | |
| 3250 | 1 | < 0.1% | |
| 4500 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.22155e+10 | 1 | < 0.1% | |
| 4200000000 | 1 | < 0.1% | |
| 2500000000 | 1 | < 0.1% | |
| 2400000000 | 1 | < 0.1% | |
| 2127519898 | 1 | < 0.1% |
| Distinct count | 91 |
|---|---|
| Unique (%) | 1.8% |
| Missing | 108 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2002.4705167173252 |
|---|---|
| Minimum | 1916.0 |
| Maximum | 2016.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 1916 |
|---|---|
| 5-th percentile | 1979 |
| Q1 | 1999 |
| median | 2005 |
| Q3 | 2011 |
| 95-th percentile | 2015 |
| Maximum | 2016 |
| Range | 100 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 12.47459892 |
|---|---|
| Coefficient of variation (CV) | 0.006229604289 |
| Kurtosis | 7.439212616 |
| Mean | 2002.470517 |
| Median Absolute Deviation (MAD) | 8.554733481 |
| Skewness | -2.29227335 |
| Sum | 9882192 |
| Variance | 155.6156182 |
| Value | Count | Frequency (%) | |
| 2009 | 260 | 5.2% | |
| 2014 | 252 | 5.0% | |
| 2006 | 239 | 4.7% | |
| 2013 | 237 | 4.7% | |
| 2010 | 230 | 4.6% | |
| 2015 | 226 | 4.5% | |
| 2008 | 225 | 4.5% | |
| 2011 | 225 | 4.5% | |
| 2005 | 221 | 4.4% | |
| 2012 | 221 | 4.4% | |
| Other values (81) | 2599 | 51.5% |
| Value | Count | Frequency (%) | |
| 1916 | 1 | < 0.1% | |
| 1920 | 1 | < 0.1% | |
| 1925 | 1 | < 0.1% | |
| 1927 | 1 | < 0.1% | |
| 1929 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2016 | 106 | 2.1% | |
| 2015 | 226 | 4.5% | |
| 2014 | 252 | 5.0% | |
| 2013 | 237 | 4.7% | |
| 2012 | 221 | 4.4% |
| Distinct count | 917 |
|---|---|
| Unique (%) | 18.2% |
| Missing | 13 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1651.7544731610337 |
|---|---|
| Minimum | 0.0 |
| Maximum | 137000.0 |
| Zeros | 55 |
| Zeros (%) | 1.1% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 281 |
| median | 595 |
| Q3 | 918 |
| 95-th percentile | 11000 |
| Maximum | 137000 |
| Range | 137000 |
| Interquartile range (IQR) | 637 |
Descriptive statistics
| Standard deviation | 4042.438863 |
|---|---|
| Coefficient of variation (CV) | 2.447360627 |
| Kurtosis | 256.7951889 |
| Mean | 1651.754473 |
| Median Absolute Deviation (MAD) | 1979.395883 |
| Skewness | 9.884733179 |
| Sum | 8308325 |
| Variance | 16341311.96 |
| Value | Count | Frequency (%) | |
| 1000 | 309 | 6.1% | |
| 11000 | 111 | 2.2% | |
| 2000 | 100 | 2.0% | |
| 3000 | 76 | 1.5% | |
| 0 | 55 | 1.1% | |
| 10000 | 47 | 0.9% | |
| 14000 | 41 | 0.8% | |
| 13000 | 40 | 0.8% | |
| 826 | 37 | 0.7% | |
| 4000 | 34 | 0.7% | |
| Other values (907) | 4180 | 82.9% |
| Value | Count | Frequency (%) | |
| 0 | 55 | 1.1% | |
| 2 | 14 | 0.3% | |
| 3 | 14 | 0.3% | |
| 4 | 12 | 0.2% | |
| 5 | 10 | 0.2% |
| Value | Count | Frequency (%) | |
| 137000 | 1 | < 0.1% | |
| 29000 | 1 | < 0.1% | |
| 27000 | 2 | < 0.1% | |
| 25000 | 3 | 0.1% | |
| 23000 | 6 | 0.1% |
imdb_score
Real number (ℝ≥0)
| Distinct count | 78 |
|---|---|
| Unique (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.442137616498116 |
|---|---|
| Minimum | 1.6 |
| Maximum | 9.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 1.6 |
|---|---|
| 5-th percentile | 4.4 |
| Q1 | 5.8 |
| median | 6.6 |
| Q3 | 7.2 |
| 95-th percentile | 8.09 |
| Maximum | 9.5 |
| Range | 7.9 |
| Interquartile range (IQR) | 1.4 |
Descriptive statistics
| Standard deviation | 1.125115866 |
|---|---|
| Coefficient of variation (CV) | 0.1746494615 |
| Kurtosis | 0.9356915064 |
| Mean | 6.442137616 |
| Median Absolute Deviation (MAD) | 0.8730186468 |
| Skewness | -0.7414713363 |
| Sum | 32487.7 |
| Variance | 1.265885711 |
| Value | Count | Frequency (%) | |
| 6.7 | 223 | 4.4% | |
| 6.6 | 201 | 4.0% | |
| 7.2 | 195 | 3.9% | |
| 6.5 | 186 | 3.7% | |
| 6.4 | 185 | 3.7% | |
| 7.3 | 184 | 3.6% | |
| 7 | 184 | 3.6% | |
| 7.1 | 181 | 3.6% | |
| 6.8 | 181 | 3.6% | |
| 6.1 | 179 | 3.5% | |
| Other values (68) | 3144 | 62.3% |
| Value | Count | Frequency (%) | |
| 1.6 | 1 | < 0.1% | |
| 1.7 | 1 | < 0.1% | |
| 1.9 | 3 | 0.1% | |
| 2 | 2 | < 0.1% | |
| 2.1 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 9.5 | 1 | < 0.1% | |
| 9.3 | 1 | < 0.1% | |
| 9.2 | 1 | < 0.1% | |
| 9.1 | 3 | 0.1% | |
| 9 | 3 | 0.1% |
| Distinct count | 22 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 329 |
| Missing (%) | 6.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.22040305473059 |
|---|---|
| Minimum | 1.18 |
| Maximum | 16.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 1.18 |
|---|---|
| 5-th percentile | 1.66 |
| Q1 | 1.85 |
| median | 2.35 |
| Q3 | 2.35 |
| 95-th percentile | 2.35 |
| Maximum | 16 |
| Range | 14.82 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 1.385112535 |
|---|---|
| Coefficient of variation (CV) | 0.6238113087 |
| Kurtosis | 90.65322055 |
| Mean | 2.220403055 |
| Median Absolute Deviation (MAD) | 0.4004107589 |
| Skewness | 9.390056312 |
| Sum | 10466.98 |
| Variance | 1.918536735 |
| Value | Count | Frequency (%) | |
| 2.35 | 2360 | 46.8% | |
| 1.85 | 1906 | 37.8% | |
| 1.78 | 110 | 2.2% | |
| 1.37 | 100 | 2.0% | |
| 1.33 | 68 | 1.3% | |
| 1.66 | 64 | 1.3% | |
| 16 | 45 | 0.9% | |
| 2.2 | 15 | 0.3% | |
| 2.39 | 15 | 0.3% | |
| 4 | 7 | 0.1% | |
| Other values (12) | 24 | 0.5% | |
| (Missing) | 329 | 6.5% |
| Value | Count | Frequency (%) | |
| 1.18 | 1 | < 0.1% | |
| 1.2 | 1 | < 0.1% | |
| 1.33 | 68 | 1.3% | |
| 1.37 | 100 | 2.0% | |
| 1.44 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 16 | 45 | 0.9% | |
| 4 | 7 | 0.1% | |
| 2.76 | 3 | 0.1% | |
| 2.55 | 2 | < 0.1% | |
| 2.4 | 3 | 0.1% |
| Distinct count | 876 |
|---|---|
| Unique (%) | 17.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7525.9645052548085 |
|---|---|
| Minimum | 0 |
| Maximum | 349000 |
| Zeros | 2181 |
| Zeros (%) | 43.2% |
| Memory size | 39.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 166 |
| Q3 | 3000 |
| 95-th percentile | 40000 |
| Maximum | 349000 |
| Range | 349000 |
| Interquartile range (IQR) | 3000 |
Descriptive statistics
| Standard deviation | 19320.44511 |
|---|---|
| Coefficient of variation (CV) | 2.567171968 |
| Kurtosis | 41.33443692 |
| Mean | 7525.964505 |
| Median Absolute Deviation (MAD) | 11022.02801 |
| Skewness | 5.05892689 |
| Sum | 37953439 |
| Variance | 373279599.2 |
| Value | Count | Frequency (%) | |
| 0 | 2181 | 43.2% | |
| 1000 | 109 | 2.2% | |
| 11000 | 83 | 1.6% | |
| 10000 | 81 | 1.6% | |
| 12000 | 62 | 1.2% | |
| 13000 | 58 | 1.2% | |
| 2000 | 56 | 1.1% | |
| 15000 | 53 | 1.1% | |
| 14000 | 50 | 1.0% | |
| 16000 | 47 | 0.9% | |
| Other values (866) | 2263 | 44.9% |
| Value | Count | Frequency (%) | |
| 0 | 2181 | 43.2% | |
| 2 | 2 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 5 | 0.1% | |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 349000 | 1 | < 0.1% | |
| 199000 | 1 | < 0.1% | |
| 197000 | 1 | < 0.1% | |
| 191000 | 1 | < 0.1% | |
| 190000 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| color | director_name | num_critic_for_reviews | duration | director_facebook_likes | actor_3_facebook_likes | actor_2_name | actor_1_facebook_likes | gross | genres | actor_1_name | movie_title | num_voted_users | cast_total_facebook_likes | actor_3_name | facenumber_in_poster | plot_keywords | movie_imdb_link | num_user_for_reviews | language | country | content_rating | budget | title_year | actor_2_facebook_likes | imdb_score | aspect_ratio | movie_facebook_likes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Color | James Cameron | 723.0 | 178.0 | 0.0 | 855.0 | Joel David Moore | 1000.0 | 760505847.0 | Action|Adventure|Fantasy|Sci-Fi | CCH Pounder | Avatar | 886204 | 4834 | Wes Studi | 0.0 | avatar|future|marine|native|paraplegic | http://www.imdb.com/title/tt0499549/?ref_=fn_tt_tt_1 | 3054.0 | English | USA | PG-13 | 237000000.0 | 2009.0 | 936.0 | 7.9 | 1.78 | 33000 |
| 1 | Color | Gore Verbinski | 302.0 | 169.0 | 563.0 | 1000.0 | Orlando Bloom | 40000.0 | 309404152.0 | Action|Adventure|Fantasy | Johnny Depp | Pirates of the Caribbean: At World's End | 471220 | 48350 | Jack Davenport | 0.0 | goddess|marriage ceremony|marriage proposal|pirate|singapore | http://www.imdb.com/title/tt0449088/?ref_=fn_tt_tt_1 | 1238.0 | English | USA | PG-13 | 300000000.0 | 2007.0 | 5000.0 | 7.1 | 2.35 | 0 |
| 2 | Color | Sam Mendes | 602.0 | 148.0 | 0.0 | 161.0 | Rory Kinnear | 11000.0 | 200074175.0 | Action|Adventure|Thriller | Christoph Waltz | Spectre | 275868 | 11700 | Stephanie Sigman | 1.0 | bomb|espionage|sequel|spy|terrorist | http://www.imdb.com/title/tt2379713/?ref_=fn_tt_tt_1 | 994.0 | English | UK | PG-13 | 245000000.0 | 2015.0 | 393.0 | 6.8 | 2.35 | 85000 |
| 3 | Color | Christopher Nolan | 813.0 | 164.0 | 22000.0 | 23000.0 | Christian Bale | 27000.0 | 448130642.0 | Action|Thriller | Tom Hardy | The Dark Knight Rises | 1144337 | 106759 | Joseph Gordon-Levitt | 0.0 | deception|imprisonment|lawlessness|police officer|terrorist plot | http://www.imdb.com/title/tt1345836/?ref_=fn_tt_tt_1 | 2701.0 | English | USA | PG-13 | 250000000.0 | 2012.0 | 23000.0 | 8.5 | 2.35 | 164000 |
| 4 | NaN | Doug Walker | NaN | NaN | 131.0 | NaN | Rob Walker | 131.0 | NaN | Documentary | Doug Walker | Star Wars: Episode VII - The Force Awakens | 8 | 143 | NaN | 0.0 | NaN | http://www.imdb.com/title/tt5289954/?ref_=fn_tt_tt_1 | NaN | NaN | NaN | NaN | NaN | NaN | 12.0 | 7.1 | NaN | 0 |
| 5 | Color | Andrew Stanton | 462.0 | 132.0 | 475.0 | 530.0 | Samantha Morton | 640.0 | 73058679.0 | Action|Adventure|Sci-Fi | Daryl Sabara | John Carter | 212204 | 1873 | Polly Walker | 1.0 | alien|american civil war|male nipple|mars|princess | http://www.imdb.com/title/tt0401729/?ref_=fn_tt_tt_1 | 738.0 | English | USA | PG-13 | 263700000.0 | 2012.0 | 632.0 | 6.6 | 2.35 | 24000 |
| 6 | Color | Sam Raimi | 392.0 | 156.0 | 0.0 | 4000.0 | James Franco | 24000.0 | 336530303.0 | Action|Adventure|Romance | J.K. Simmons | Spider-Man 3 | 383056 | 46055 | Kirsten Dunst | 0.0 | sandman|spider man|symbiote|venom|villain | http://www.imdb.com/title/tt0413300/?ref_=fn_tt_tt_1 | 1902.0 | English | USA | PG-13 | 258000000.0 | 2007.0 | 11000.0 | 6.2 | 2.35 | 0 |
| 7 | Color | Nathan Greno | 324.0 | 100.0 | 15.0 | 284.0 | Donna Murphy | 799.0 | 200807262.0 | Adventure|Animation|Comedy|Family|Fantasy|Musical|Romance | Brad Garrett | Tangled | 294810 | 2036 | M.C. Gainey | 1.0 | 17th century|based on fairy tale|disney|flower|tower | http://www.imdb.com/title/tt0398286/?ref_=fn_tt_tt_1 | 387.0 | English | USA | PG | 260000000.0 | 2010.0 | 553.0 | 7.8 | 1.85 | 29000 |
| 8 | Color | Joss Whedon | 635.0 | 141.0 | 0.0 | 19000.0 | Robert Downey Jr. | 26000.0 | 458991599.0 | Action|Adventure|Sci-Fi | Chris Hemsworth | Avengers: Age of Ultron | 462669 | 92000 | Scarlett Johansson | 4.0 | artificial intelligence|based on comic book|captain america|marvel cinematic universe|superhero | http://www.imdb.com/title/tt2395427/?ref_=fn_tt_tt_1 | 1117.0 | English | USA | PG-13 | 250000000.0 | 2015.0 | 21000.0 | 7.5 | 2.35 | 118000 |
| 9 | Color | David Yates | 375.0 | 153.0 | 282.0 | 10000.0 | Daniel Radcliffe | 25000.0 | 301956980.0 | Adventure|Family|Fantasy|Mystery | Alan Rickman | Harry Potter and the Half-Blood Prince | 321795 | 58753 | Rupert Grint | 3.0 | blood|book|love|potion|professor | http://www.imdb.com/title/tt0417741/?ref_=fn_tt_tt_1 | 973.0 | English | UK | PG | 250000000.0 | 2009.0 | 11000.0 | 7.5 | 2.35 | 10000 |
Last rows
| color | director_name | num_critic_for_reviews | duration | director_facebook_likes | actor_3_facebook_likes | actor_2_name | actor_1_facebook_likes | gross | genres | actor_1_name | movie_title | num_voted_users | cast_total_facebook_likes | actor_3_name | facenumber_in_poster | plot_keywords | movie_imdb_link | num_user_for_reviews | language | country | content_rating | budget | title_year | actor_2_facebook_likes | imdb_score | aspect_ratio | movie_facebook_likes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5033 | Color | Shane Carruth | 143.0 | 77.0 | 291.0 | 8.0 | David Sullivan | 291.0 | 424760.0 | Drama|Sci-Fi|Thriller | Shane Carruth | Primer | 72639 | 368 | Casey Gooden | 0.0 | changing the future|independent film|invention|nonlinear timeline|time travel | http://www.imdb.com/title/tt0390384/?ref_=fn_tt_tt_1 | 371.0 | English | USA | PG-13 | 7000.0 | 2004.0 | 45.0 | 7.0 | 1.85 | 19000 |
| 5034 | Color | Neill Dela Llana | 35.0 | 80.0 | 0.0 | 0.0 | Edgar Tancangco | 0.0 | 70071.0 | Thriller | Ian Gamazon | Cavite | 589 | 0 | Quynn Ton | 0.0 | jihad|mindanao|philippines|security guard|squatter | http://www.imdb.com/title/tt0428303/?ref_=fn_tt_tt_1 | 35.0 | English | Philippines | Not Rated | 7000.0 | 2005.0 | 0.0 | 6.3 | NaN | 74 |
| 5035 | Color | Robert Rodriguez | 56.0 | 81.0 | 0.0 | 6.0 | Peter Marquardt | 121.0 | 2040920.0 | Action|Crime|Drama|Romance|Thriller | Carlos Gallardo | El Mariachi | 52055 | 147 | Consuelo Gómez | 0.0 | assassin|death|guitar|gun|mariachi | http://www.imdb.com/title/tt0104815/?ref_=fn_tt_tt_1 | 130.0 | Spanish | USA | R | 7000.0 | 1992.0 | 20.0 | 6.9 | 1.37 | 0 |
| 5036 | Color | Anthony Vallone | NaN | 84.0 | 2.0 | 2.0 | John Considine | 45.0 | NaN | Crime|Drama | Richard Jewell | The Mongol King | 36 | 93 | Sara Stepnicka | 0.0 | jewell|mongol|nostradamus|stepnicka|vallone | http://www.imdb.com/title/tt0430371/?ref_=fn_tt_tt_1 | 1.0 | English | USA | PG-13 | 3250.0 | 2005.0 | 44.0 | 7.8 | NaN | 4 |
| 5037 | Color | Edward Burns | 14.0 | 95.0 | 0.0 | 133.0 | Caitlin FitzGerald | 296.0 | 4584.0 | Comedy|Drama | Kerry Bishé | Newlyweds | 1338 | 690 | Daniella Pineda | 1.0 | written and directed by cast member | http://www.imdb.com/title/tt1880418/?ref_=fn_tt_tt_1 | 14.0 | English | USA | Not Rated | 9000.0 | 2011.0 | 205.0 | 6.4 | NaN | 413 |
| 5038 | Color | Scott Smith | 1.0 | 87.0 | 2.0 | 318.0 | Daphne Zuniga | 637.0 | NaN | Comedy|Drama | Eric Mabius | Signed Sealed Delivered | 629 | 2283 | Crystal Lowe | 2.0 | fraud|postal worker|prison|theft|trial | http://www.imdb.com/title/tt3000844/?ref_=fn_tt_tt_1 | 6.0 | English | Canada | NaN | NaN | 2013.0 | 470.0 | 7.7 | NaN | 84 |
| 5039 | Color | NaN | 43.0 | 43.0 | NaN | 319.0 | Valorie Curry | 841.0 | NaN | Crime|Drama|Mystery|Thriller | Natalie Zea | The Following | 73839 | 1753 | Sam Underwood | 1.0 | cult|fbi|hideout|prison escape|serial killer | http://www.imdb.com/title/tt2071645/?ref_=fn_tt_tt_1 | 359.0 | English | USA | TV-14 | NaN | NaN | 593.0 | 7.5 | 16.00 | 32000 |
| 5040 | Color | Benjamin Roberds | 13.0 | 76.0 | 0.0 | 0.0 | Maxwell Moody | 0.0 | NaN | Drama|Horror|Thriller | Eva Boehnke | A Plague So Pleasant | 38 | 0 | David Chandler | 0.0 | NaN | http://www.imdb.com/title/tt2107644/?ref_=fn_tt_tt_1 | 3.0 | English | USA | NaN | 1400.0 | 2013.0 | 0.0 | 6.3 | NaN | 16 |
| 5041 | Color | Daniel Hsia | 14.0 | 100.0 | 0.0 | 489.0 | Daniel Henney | 946.0 | 10443.0 | Comedy|Drama|Romance | Alan Ruck | Shanghai Calling | 1255 | 2386 | Eliza Coupe | 5.0 | NaN | http://www.imdb.com/title/tt2070597/?ref_=fn_tt_tt_1 | 9.0 | English | USA | PG-13 | NaN | 2012.0 | 719.0 | 6.3 | 2.35 | 660 |
| 5042 | Color | Jon Gunn | 43.0 | 90.0 | 16.0 | 16.0 | Brian Herzlinger | 86.0 | 85222.0 | Documentary | John August | My Date with Drew | 4285 | 163 | Jon Gunn | 0.0 | actress name in title|crush|date|four word title|video camera | http://www.imdb.com/title/tt0378407/?ref_=fn_tt_tt_1 | 84.0 | English | USA | PG | 1100.0 | 2004.0 | 23.0 | 6.6 | 1.85 | 456 |